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ABSTRACT 

We investigate the clustering of galaxies selected in the 3.6 (a.m band of the Spitzer Wide-area 
Infrared Extragalactic (SWIRE) legacy survey. The angular two-point correlation function is 
calculated for eleven samples with flux limits of 6*3 g ^ 4^00 (Jy, over an 8 square degree 
field. The angular clustering strength is measured at > S-cr significance at all flux limits, with 
amplitudes of A = (0.49-29) x 10~^ at one degree, for a power-law model, A9^^-^. We 
estimate the redshift distributions of the samples using phenomological models, simulations 
and photometric redshifts, and so derive the spatial correlation lengths. We compare our re- 
sults with the GallCS (Galaxies In Cosmological Simulations) models of galaxy evolution 
and with parameterized models of clustering evolution. The GallCS simulations are consis- 
tent with our angular correlation functions, but fail to match the spatial clustering inferred 
from the phenomological models or the photometric redshifts. We find that the uncertainties 
in the redshift distributions of our samples dominate the statistical errors in our estimates of 
the spatial clustering. At low redshifts (median z ^ 0.5) the comoving correlation length is 
approximately constant, rg = 6.1 ± 0.5h^^ Mpc, and then decreases with increasing redshift 
to a value of 2.9 ± 0.3/i~^ Mpc for the faintest sample, for which the median redshift is z ~ 1. 
We suggest that this trend can be attributed to a decrease in the average galaxy and halo mass 
in the fainter flux-limited samples, corresponding to changes in the relative numbers of early- 
and late-type galaxies. However, we cannot rule out strong evolution of the correlation length 
over 0.5 < z < 1. 

Key words: galaxies; evolution - galaxies: statistics - infrared: galaxies - large-scale struc- 
ture of universe 



1 INTRODUCTION 

Galaxies are not distributed randomly across the sky. At least at low 
redshifts, they appear to trace distinct patterns: galaxy clusters are 
connected to each other by long, filamentary structures of galax- 
ies, interspersed with large voids in which few or no galaxies are 
seen. A plausible theoretical motivation has arisen for the forma- 
tion of such large-scale structures (LSS) of galaxies, namely that 
the galaxies are tracing an underlying distribution of dark matter. 
In their most modern form, models for the formation of these large 
scale structures postulate that the evolution of the dark matter den- 



sity field is inextricably link ed to the format i on and evolution of 
the galaxies thems elves (e.g.. lCole et alj200ulGranato et al.l2OO0l : 
iHatton et al.ll2003h . These models generally invoke some variation 
of the biased hierarchical paradigm, in which overdensities, or 'ha- 
los', in the dark matter distribution undergo successive mergers 
over time to build halos of increasing mass, with galaxies forming 
from the baryonic matter in these halos. 

From these models, and recent observations, it is clear that the 
relationship between the properties of galaxies, and the properties 
of the dark matter halos in which they reside, is subtle, and is an 
area in which observational constraints are particularly valuable in 
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constraining models. We would like to know, observationally, what 
sort of galaxy occupies what sort of halo as a function of redshift, 
and how the properties of galaxies change with both redshift and 
the masses of their parent halos. One method that has proven espe- 
cially useful in providing such observational constraints is measur- 
ing clustering amplitudes. Fundamentally, the biased hierarchical 
paradigm requires that overdensities of dark matter should them- 
selves cluster together on the sky, wit h the strength o f clustering de- 
pending on their mass (K aiser 1984: : lBardeen et alj|l986) . In prin- 
ciple, we can make great strides in understanding the relationship 
between galaxies and the underlying dark matter distribution by 
measuring the clustering strength of galaxies selected in a particular 
way, and relat ing this to theoretical predictions for halo clustering 
( lBensonet"ai]|2001.) . 

Such observations are, however, not straightforward to per- 
form. Reliable clustering measurements require large, homoge- 
neous samples of sources selected over large enough areas of sky 
to sample a range of dark matter density regimes. Ideally, we 
would like such observations to be performed in the near- and mid- 
infrared: the (restframe) near-infrard is most sensitive to evolved 
stars and so can pick up large samples of passively evolving sys- 
tems, whereas the mid-infrared is sensitive to the dusty, active 
sources in which the stars and central black holes in (at least some) 
passively evolving systems are thought to form. Infrared observa- 
tories available up to now, however, have not been capable of map- 
ping large enough areas to the required depths to find sufficient 
numbers of sources, or in enough bands to even crudely discrimi- 
nate between different populations. 

^The launch of the Spitzer Space Telescope jWemer et al.l 

1 2004) offers the potential to overcome these problems, due to its 
ability to map large areas of sky in the infrared to greater depths 
than any previous observatory, and in multiple bands so that dusty, 
active systems can be differentiated from passively evolving sys- 
tems. The Spitzer Wide-area Infrared Extragalactic (SWIRE) sur- 
vey (Lonsdale et al. 2003, 2004) is the largest of the Spitzer Space 
Telescope's six Cycle 1 legacy programmes. The survey covers 
a total area of 49 square degrees, split between six fields, in all 
seven of Spitzer's imaging bands (3.6-160 (xm). The area and depth 
of SWIRE combine to produce a survey of significant comoving 
volume, 0.2h^'^ Gpc'^ over < 2 < 2, and spatial scales of 
~ lOO/i"^ Mpc at 2 ^ 1. 

A principal goal of SWIRE is to study t he clustering behav iour 
of a variety of extragalactic populations. In I Oliver et alj ( 120041) we 
presented the first detection of galaxy clustering in the survey, mea- 
suring a two-point angul ar correlation func tion at 3.6 \im from 
our validation data, and in lParrah et al.l J2006I) we presented results 
on the clustering of Ultr aluminous Infrared Galaxies (ULIRGs) at 
z > l. lFang et alj ( l2004r) presented angular correlation functions at 
3.6-8.0 (xm from the 4-square degree Spitzer First Look Survey. 

In this paper we extend the analysis of the 3.6-|J.m clus- 
tering to larger scales and fainter flux limits (higher redshifts). 
We begin with a summary of definitions and formalisms in sec- 
tion [2] then in section [3] we discuss the sample selection, includ- 
ing star/galaxy separation and the angular selection function. In 
section |4] we present our measurements of the two-point angular 
correlation function. We compare our results with previous mea- 
surements in the K-band (section [5j and with the GallCS semi- 
analytical simulations (section [SJ. The angular clustering ampli- 
tudes are used to estimate the spatial correlation lengths, which 
are then compared with simple parameterized models of clustering 
evolution, the GallCS simulations and results from the literature 
(section I?}. Section [8] draws together some conclusions from our 



analysis. We use Hq = lOO/i' km s ^ Mpc~^ with Qm = 0.3 
and f^A =0.7. Magnitudes are in the AB system unless otherwise 
noted. 



2 DEFINITIONS AND LIMBER'S EQUATION 

The spatial two-point correlation function ^(r, z) is defined through 
the joint probablility 



dP{r,z) = N^[l + ^(r,z)]dVidV2 



(1) 



of finding a galaxy in the volume element dVi and a second 
galaxy in the volume element dV2 separated by a distance r at 
a red shift z, where N(z) is the mean number density of sources 
(e.g.. |PhilliDPS et alJI 19781) . In comoving coordinates, the correla- 
tion function can be parameterized as 



C(r,z)= - 



(l + z 



,7-(3 + <!) 



(2) 



where ro measures the strength of the clustering at 2 = 0, 7 mea- 
sures the scal e-dependence and e parameterizes the evo lution with 
redshift (e.g.. |Phillipps et al.lll978l : IOverzier et al.ll2003h . 

Several special values of e have particular interpretations. 

(1) e = is the stable clustering model, where the correlation func- 
tion is fixed in proper coordinates and clustering grows stronger 
as the background mass distribution expands with the universe. 

(2) e — 7 — 3 is the comoving case, where clustering remains 
constant in comoving coordinates and simply expands with the uni- 
verse. (3) e = 7 — 1 is the linear growth model, which corresponds 
to the application of linear perturbation theory to a scale-free power 
spectrum in an Einstein-de Sitter universe. We note that these mod- 
els are qualitative indicators of possible evolution scena rios, rather 
than realistic clustering models dMoscardini et aljl998h . 

The angular two-point correlation function w{8) is a measure 
of the number of pairs of galaxies with separation 9 compared with 
that expected for a random distribution. It is defined through the 
joint probability 



dP{e) ^ Nq[i + w{e)]dnidn2 



(3) 



of finding a galaxy in solid angle df^i and a second galaxy in solid 
angle d^2 separated by an angle 0, where Nq is the mean number 
density of sources (per steradian) in the survey (e.g., Phillipps et alj 
Il978l) . If ui(6') is zero, the distribution of galaxies is unclustered. 

The angular correlation function, w{9), is the projection along 
the line of sight of the spatial correlat ion function, ^(r, z), and can 
be calculated from Limber's equation (lLimbeiil953luPhillipps et al.l 
1978). If ^(r, z) is parameterized as a power-law, as above, then 
w{9) is also a power-law 



w{e) = AB 



1-7 



(4) 



The amplitude. A, of t he angular corr elation function can be ex- 
pressed (following, e.g.. lEfstathiou et al._1991.) as 

^^rJl ir H{z) (1 + 2)-'^+-) D'r jdN/dzf dz ^^^ 

[j;^{dN/dz)dzY 



where 



/ = 



/¥r([7-i]/2) 
r(7/2) 



(6) 



with r being the standard gamma function. Here, dN/dz is the red- 
shift distribution. Da is the angular diameter distance, and H{z) is 
the Hubble parameter, 
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H{z) = Hoy/nM{i + zy-i + nk{i + zY + n. 



(7) 



(where we have neglected the radiation energy density). The spatial 
correlation length, ro, can thus be calculated from the amplitude of 
the angular correlation function (eq. [5]( if the redshift distribution, 
dN/dz, of the sources in the survey is known. 



3 SAMPLE SELECTION 

3.1 Spitzer observations 

The SWIRE-EN 1 field has an area of ~9 deg'^ and is coincident 
with one of five fields observed as part of the Europea n Large- Area 
ISO ( Infrared Space Observatory) Survey, ELAIS JOliver et al.l 
uOOd : iRowan-Robinson et alj 120041) . The nominal field centre is 
le'^ll^OO" -h55°00'00" (12000). The field was mapped by 
Spitzer at 3 .6, 4.5, 5.8 and 8 .0 |J.m with the Infrared Array Cam- 
era (IRAC. iFazio et alj|2004h on 2004 January 14-20, and at 24, 
70 and 160 u.m w ith the Multiband Imaging Photometer (MIPS, 
iReike et aljr2004h on 2004 January 21-28 and 2004 July 29. The 
data can be retrieved from the Spitzer archive with a Program Iden- 
tification (PID) of 185, and the enhanced data products (image mo- 
saics and catalogues) are available from the Spitzer Science Cen- 
terQ 

Full detail s of the observations and data processing are given 
in ISurace et alj ilOOS ): here we summarize the essential details. 
The SWIRE-EN 1 field was mapped by IRAC with a large grid of 
pointings, and at each grid point two 30-second images were taken, 
each one consisting of multiple dithered exposures. The entire grid 
was repeated in two epochs, offset by half an array width. Thus, for 
any point on the sky there are a minimum of four independent sight- 
ings (images), and these sightings occur on widely spaced parts 
of the detector array in order to minimize instrumental signatures. 
The entire survey has a minimum depth of four coverages, equal to 
120 seconds of exposure time. In some areas this can be as high as 
sixteen coverages, or 480 seconds. 

The IRAC data were reduced and flux-calibrated by the Spitzer 
Science Center. Further processing of the individual images re- 
moved a number of effects (mostly due to bright objects, pri- 
maril y stars) that remained in the pipeline products JSurace et al.l 
120060 . The images were then coadded into sixteen large mosaics 
(or 'tiles') of approximately 0.8 x 0.8 square degrees each. Sources 
were detected and their photometry m easured with the S EXTRAC- 
TOR package (Bertin & Arnouts 1996). We used the flux measured 
in a circular aperture of 3'.'8 diameter, unless the source was signif- 
icantly extended, in which case we used the flux within the Kron 
aperture. The source catalogu e was a superset of the SWIRE Data 
Release 2 JSurace et alj2005l) , containing fainter objects than pub- 
lished at that time. 



3.2 Sample definition 

We analyzed eleven flux-limited samples selected at 3.6 |i.m from 
the SWIRE-EN 1 catalogue (Table \Q. The deepest sample con- 
tained sources with flux densities 5*3.6 ^ 4.0 viJy (or ttisb < 
22.4 mag), this limit being defined by the flux density at which 
the differential completeness is approximately 50 per cent. At this 
level, the integral completeness is 82 per cent (see Fig. [T] and dis- 
cussion in section 13. 3t . Flux intervals of A log S3.6 ~ 0.2 or 
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Table 1. Angular and spatial clustering strengths for each of the samples. 
S36 are the flux limits. A are the amplitudes of the angular correlation func- 
tions and AC are the integral constraints, (z) are the median redshifts and 
r'o are the spatial congelation lengths, derived from the GallCS redshift dis- 
tributions for the Sse = 4.0-15.9 |aJy samples and from the IMPZ redshift 
distributions for the brighter samples (see section|7). 



S36 


A 


AC 


(^> 


'"0 


HJy 


10-3 


10-3 




/i-l Mpc 


4.0 


0.49 ± 0.10 


0.42 ± 0.09 


1.00 


2.93 ±0.34 


6.3 


0.63 ±0.14 


0.54 ±0.12 


0.90 


3.18 ±0.38 


10.0 


0.98 ±0.14 


0.84 ±0.12 


0.80 


3.84 ±0.31 


15.9 


1.48 ±0.08 


1.26 ±0.07 


0.70 


4.48 ±0.14 


25.2 


1.90 ±0.17 


1.63 ±0.15 


0.65 


4.78 ±0.24 


40.0 


2.47 ±0.35 


2.12 ±0.30 


0.60 


5.28 ±0.41 


63.4 


4.73 ±0.19 


4.09 ±0.17 


0.50 


6.70 ±0.15 


100.5 


7.40 ± 0.29 


6.39 ± 0.25 


0.38 


6.47 ±0.14 


159.2 


11.31 ±0.93 


9.77 ±0.81 


0.28 


5.58 ±0.25 


252.4 


17.98 ± 1.90 


15.54 ±1.64 


0.20 


5.59 ±0.33 


400.0 


29.12 ±2.61 


25.16 ±2.25 


0.17 


6.15 ±0.31 



Am36 = 0.5 were used, corresponding to intervals of Az ~ 0.1 
in the median redshift of the samples (see section ITJI . 

The flux limit of the brightest sample was set by the need to 
have at least 1000 sources in order to measure the ampitude of the 
angular correlation function with more than 3-a significance. This 
limit on the minimum number of sources was determined empiri- 
cally, by calculating w{9) for different subsets of the data, varying 
both the total number of sources and the width of the angular bins 
in each subset. The brightest sample contained 1501 sources with 
S3.6 > 400.0 \xJy (or mge < 17.4 mag). 



3.3 Angular selection function 

In order to calculate clustering statistics, we require an angular se- 
lection function that would describe the distribution of sources in 
the survey if there was no clustering. This takes the form of a mask, 
where the value of the mask at each position is the relative proba- 
bility of finding a source at that location on the sky in the absence 
of clustering. This mask is then used to simulate a random (i.e., 
unclustered) catalogue of objects (section |4Tt . The probability of 
detecting a source at any given position depends on the complete- 
ness of the survey at that point, which is a function of the noise in 
the image. A noise map was calculated from the coverage map (i.e. 
a map of the integration time) and the completeness function was 
found from simulations, as follows. 

Coverage maps were generated for each individual mosaic, 
recording the number of independent images contributing to each 
pixel, after taking into account the complex dithering pattern and 
any missing data due to cosmic ray rejection. The combined map 
for the whole SWIRE-EN 1 field was rebinned by a factor of 5 to a 
pixel scale of 3 arcsec, reducing the size of the map so that it would 
fit into memory. This rebinning effectively smoothed the coverage 
map on a scale of 3 arcsec, closely matching the 3'.'8-diameter pho- 
tometric aperture. Pixels with mean coverage less than 2.95 were 
excluded from the mask (this allowed for up to one of the images 
in the full coverage areas to be flagged and rejected due to a cosmic 
ray). The noise in the mosaics scales with the integration time, t, as 
a oc l/\/t and so vari es with coverage (n umber of codded images), 
K, also as cr oc l/\/^ JSurace et alj|2005r) . The coverage mask was 
then used as a proxy for a noise map. 
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Figure 1. The integral completeness function at 3.6 |am. A'^(> 5) is the cu- 
mulative number of sources with fluxes ^ Ss.g, and No{> S) is similarly 
the cumulative number of sources after correction for incompleteness. The 
dotted lines show the flux limits of our samples. 



We calculated the completeness function by simulating artifi- 
cial sources and adding them into the SWIRE images. The source 
extraction stage of the analysis was then repeated, and the new 
source list was compared with the known positions and fluxes of 
the artificial sources. The fraction of simulated sources that were 
recovered by the source extraction was computed as a function of 
flux and coverage (noise). Fig. [T] shows the integral completeness 
as a function of flux limit, for an average coverage of 5.0 pointings. 
The survey is 99% complete at 22 ^ijy, 95% complete at 8.1 p.Jy, 
and the integral completeness falls to 82% for our faintest sample 
at 4.0 [i-Jy. The simulation results also confirmed that the complete- 
ness, /, scaled as expected with coverage, k, and flux limit, S, as 
f{S, k) — f{^/7iS, 1). At every point in the coverage map, we cal- 
culated the integral completeness given the flux limit and coverage. 
This was then the relative probability that a source in the survey 
could have been found at that location, in the absence of clustering. 

The final contribution to the angular selection function was 
to exclude circular regions around bright point sources . For this 
we u sed the Two-Micron All Sky Survey (2MASS, Skrut skie et al.l 
12006 ). masking a circle around all K ^ 12 point sources within 
a radius, R, given by log i?(arcsec) = 3.1 — 0.16 A'. This radius 
was determined by visual inspection of the SWIRE images to find 
the distance at which the star's PSF merges into the background. 
Note that these values are about a f actor of two larger than that 
used in the public SWIRE catalogue JSurace et al.ll2005r) , giving a 
more conservative mask. 



3.4 Star/galaxy separation 

The presence of stars in the source catalogue artificially dilutes the 
strength of the galaxy correlations, so it was necessary to remove 
the stars before performing the clustering analysis. To make the best 
use of the large survey area, we did not want to be restricted to the 
limited area with optical coverage, so we developed a procedure to 
remove stars using only infrared criteria. Our goal was to minimize 
the number of stars remaining in the sample, but without removing 
an excessive number of galaxies. 

We explored a range of magnitude, colour and stellarity selec- 
tion criteria in order to identify stars in the infrared data, develop- 



ing a three-stage process to remove them. First, the SWIRE cata- 
logue was cross-correlated with the 2MASS survey to classify the 
bright sources. Objects flagged as being extended in 2MASS are 
galaxies and bright {K ^ 14 mag) point sources are stars. Second, 
faint 2MASS sources were classified based on their near- to mid- 
infrared colours and stellarity at 3.6 [xm. Point-like (stellarity > 
0.94) sources are identified as stars. Sources that are both blue 
(J — mae < —1.50 or H — mze < —2.2) and not clearly resolved 
(stellarity > 0.06) are also stars. Third, stars fainter than 2MASS 
were classified based on their mid-infrared colours and 3.6 [xm stel- 
larity, where blue compact sources are stars. We used three flux 
bins (m45 ^ 19.5, 19.5 < m45 ^ 20.0 and 20.0 < rm^ ^ 23.0) 
with colour cuts of nise — 77145 < —0.35, —0.30 and —0.25, and 
stellarity > 0.8, 0.8 and 0.7 respectively to identify stars. 

A subset of the SWIRE-EN 1 field has optical imaging data 
which we used to estimate the effectiveness of our infrared 
star/gal axy separation. The Isaac New ton Tele scope Wide Angle 
Survey JMcMahon et alJl200ll : lGonzalez-Solares et al.ll2004 b) ob- 
served 6.4 deg^ of the SWIRE-EN 1 survey in five optical bands 
(U, g' , r' , i' , Z) to r' ~ 23.5 mag. We identified optical counter- 
parts to the SWIRE sources and selected those sources with high 
optical (i-band) stel larity according to the S EXTRACTOR source 
extraction software JBertin & Amoutslll996l) . These sources con- 
stituted a reference list of stars, against which we compared the 
infrared classifications. At bright fluxes (inse < 21) the stellar 
contamination in the galaxy sample was estimated to be < 3%, 
rising to 4% at mae — 22-23. Fainter than m-^a — 23, t he total star 
counts are < 3% of the galaxy counts JFazio et alj2004n so the con- 
tamination is still low, even though we can no longer identify stars 
at these faint magnitudes. We also compared our galaxy sample 
with a list of stars identified by the IMPZ p hotometric redshift esti - 
mation code (Babbedge et al. 2004 : Rowan-Robinson et al.ll2005h . 
Again we found that the stellar contamination in our infrared galaxy 
catalogue was only 2^% at m^a < 23 mag. 

At all fluxes, approximately 10% of the galaxies were rejected 
by our star selection criteria. There was a slight bias towards re- 
jecting blue (in near- to mid-infrared colours) compact galaxies, 
but the fraction of galaxies rejected was sufficiently small that this 
is not expected to significantly bias the measurement of the angular 
correlation function. 



4 ANGULAR CORRELATION FUNCTIONS 

4.1 Method 

The angular correlation function (Eq. |4ll was estimated by com- 
paring the distribution of galaxies in the survey with catalogues 
of random sources. The random catalogues were simulated using 
the angular selection function (Section 13. 3t , such that the angu- 
lar distribution of the random sources reflected the geometry and 
variable depth of the actual survey. We compared each real dataset 
with 1000 random catalogues, each containing the same number of 
sources as the galaxy catalogue, ensuring that the uncertainty in the 
correlation function was not dominated by the scatter between the 
random catalogues. 

A number of methods have been prop osed to calculate the an- 
gular correlation function; here we use the lLandv & Szalavl ( Il993h 
estimator. 



w{e)^ 



DD - 2DR + RR 
RR 



(8) 



where DD is the number of galaxy-galaxy pairs at separation 



Galaxy clustering at 3.6 ^m 5 



9, normalized by the total number of pairs over all separations; 
and DR and RR are similarly the normalized number of galaxy- 
random and random-random pairs respectively. 

The computationally-intensive step in the calculation is count- 
ing the number of pairs of sources, DP, DR and RR . For this we 
used the NPT pair-counting code of iGrav et alj J2004h which uses 
kd-trees to greatly accelerate the speed of the calculation compared 
with naive counting methods. We used twelve logarithmically- 
spaced bins in 6, over the range 0.001 < < 4.0 degrees, with 
A log 61 = 0.3. 



4.2 Error estimates and parameter fitting 

The large number of sources in the SWIRE dataset allowed us to 
calculate the errors on the correlation function by comparing sub- 
sets of the data. For each of the six faint flux-limited samples (4.0- 
40.0 (i-Jy) we divided the data into 9-25 subsamples of 10,000- 
15,000 sources each. Two sampling methods were used. First, we 
randomly selected galaxies across the full field; this gave good 
statistics on the larger scales (> 0.05 deg). Second, we obtained 
better statistics on smaller scales by dividing each flux-limited sam- 
ple into a grid of 9-25 smaller regions and calculating w{6) using 
all the sources within a sub-region. 

For each sub-sampling method, this produced n — 9-25 in- 
dependent estimates of the correlation function, Wi(6j), where i 
labels the subsample and j labels the angular bin. From these n 
estimates, we calculated the mean w{6j) = X/'-i UJi{9j)/n for 
each bin. The covariance between angular bins dj and dk is given 
by 



wn,{e)^A{e 
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C). 



(11) 



2 _ 



1 



1 ^^ 



[MSj) - ^{SM^^iSk) - w{Sk)]. 



(9) 



As other authors have also found (e.g.. IZehavi et alj|2002h . the 
off-diagonal terms become increasingly noisy for elements farther 
away from the diagonal, making inversion of the covariance ma- 
trix unstable, so we only retained the covariances between adjacent 
bins, i.e. we set a|;. = for |j — fc| > 1. 

The brighter flux-limited samples (63^00 ji-Jy) contained too 
few sources to divide into independent samples, so to calculate the 
errors we used the jackknife technique. Each sample was divided 
into a grid of 4 x 4 sub-areas and we calculated Wi{6j) sixteen 
times, each time excluding a different sub-area. The best estimate 
of the correlation function is the mean of the Wi {6j ) and the covari- 
ance is 



0"jfe - 



n- 1 



1=1 



IMSj) ~ ti^ieMM^k) - w{ek)] 



(10) 



dScranton et al.ll2002h . Again, we only retained the covariances be- 
tween adjacent bins. 

In Fig. |2] we plot the angular correlation functions for the 
eleven samples, corrected by the integral constraint discussed be- 
low. For the faint samples, we show the data from the full-field 
subsampling, which give the best results on large scales and are 
consistent on smaller scales with the results of the sub-region sam- 
pling. 

The angular correlation function is parameterized as a power 
law A9^^'' (Eq.|4]l, where the amplitude. A, measures the strength 
of the clustering, the index, 7, measures its scale-dependence, and 
9 is measured in degrees. Due to the finite size of the survey, the 
observed 'w{9) is a biased estimator of the real correlation function, 
and can be modeled as 



The integral constraint, AC, can be estimated by doubly integrat- 
ing the (assumed) true Wm(^) over the area of the survey, a calcu- 
lation that can be done numerically using the random-random pair 
counts. 



C = 



E,Nrri9j)9^ 



1-7 
J 



E,Nrr{9,) 



(12) 



where Nrr{9j) are the u nnormalized count s and the summation is 
over all the angular bins (Roche et al.lll999f) . 

We fitted the model (Eg. II It to the observed correlation func- 
tion w{9j) for each of the data samples by minimizing the general- 
ized x^, defined as 



i=i fc=i 



(13) 



where Hjk is th e inverse of the covariance matrix, (cr|j,)~^ 
jPoUo et al.ll2005l) . A wide range of optical and infrared surveys 
indicate that 7 = 1.8 and we found that this was consistent with 
the present data, so fixed 7 to this value for comparison with other 
surveys. The best-fitting models are shown with the data in Fig. |2] 
and the amplitudes and integral constraints are listed in Table[T] 



5 COMPARISON WITH K-BAND SURVEYS 

The angular correlation function is the projection along the line of 
sight of the spatial correlation function, ^(r, z), and is dependent 
on both the redshift distribution and luminosity function of galax- 
ies in the survey and on the evolution of the spatial clustering. At 
fainter flux limits the survey probes to higher redshifts (larger vol- 
umes) and lower luminosities, and both these reduce the strength 
of the projected clustering. This is shown in Fig. [3] where we plot 
the amplitude. A, of the angular correlation function against the 
limiting magnitude for a range of iC-band surveys. We compare 
the 3.6-|i.m SWIRE results with previous K-bznd data, due to the 
abundance of clustering measurements in K and the re latively few 
measurements at 3.6-vim (Fang et al. 2004 ; iOliver et al. 200 4). The 
emission in both the K and 3.6-p.m bands arises from the old stars 
in a galaxy - both bands are relatively insensitive to the current star 
formation rate and are good tracers of the stellar mass. 

We estimated the equivalent JC'-band limit (Vega system) for 
each of our 3.6 |i.m selected samples using the average K — mae 
colour of SWIRE galaxies detected in the Early Data Release o f 
the UKIRT Infrared Deep Sky Survey (UKIDSS; lD"ve et alj|2006l) . 
We overplot the SWIRE clustering amplitudes on Fig.|3] where the 
errors in A'-band magnitude correspond to the standard deviation 
of the K — jnse colour distributions. Our data are consistent with 
these other surveys, confirming that both bands are selecting sim- 
ilar galaxy populations, with our new results having significantly 
smaller errors in the amplitudes, due to the much larger survey area 
at fainter fluxes. The larger area of future UKIDSS data releases 
will enable us to reduce the uncertainty in the equivalent K-band 
limits of SWIRE, particularly at the bright end where there are rel- 
atively few sources in UKIDSS at present. 

Also plotted in Fig.[3]are three models of clusteri ng evolution 
(Eg. [2) , with e = 0, -0.4 & -1.2, from figure 7 of iRoche et al] 
( I2OO3I) . Following Eq. |5] the amplitude of the angular correlation 
function can be calc ulated if the redshift distr ibution (dN/dz) of 
the survey is known. iRoche et al.l ( 120021120030 predict the redshift 
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Figure 2. Angular correlation functions w{8) for the eleven samples with flux density limits of 4-400 liTy. Solid lines are the best-fitting power-law model with 
7 = 1.8, and the data have been connected for the integral constraint. 



distributions using a simple galaxy evolution model, where they 
evolve a i('-band galaxy luminosity function according to stellar 
population synthesis codes, given a star formation history and a 
galaxy merger rate. We see that the model that best fits the SWIRE 
data is their stable clustering model (e — 0), with their comoving 
model (e = —1.2) rejected at > 5-a at the faintest magnitudes 
(K ~ 21). We note that these models of the angular clustering are 
dependent on a range of param eters, not just the e volution of the 
spatial clustering, e, so it is the iRoche et al.l ( 120030 merger model 
with comoving evolution that is a poor fit to the data, and this does 
not imply that comoving clustering in general can be rejected. 



6 COMPARISON WITH GALICS MOCK CATALOGUES 

GallCS (Galaxies In Cosmological Simulations) is a hybrid model 
of galaxy evolution which combines high-resolution N-body simu- 
lations of the dark matter content of the universe with semi-analytic 
prescriptions to describe the fate of the baryons within the dark 
matter halos (Hatton et al. 2003). The simulations have 256"^ par- 
ticles of mass 8 x 10^ Mq, in a 100/i^^ Mpc box with a spatial 
resolution of 20/i~^ kpc. Within each halo, some fraction of the gas 
mass is cooled and turned into stars which then evolve. The spectral 
energy distributions of these model galaxies are computed by sum- 
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Table 2. Spatial con'elation lengths, tq (h ^ Mpc), and median redshifts, (2), derived from the SWIRE 
clustering amplitudes and each of the redshift distributions. Ssq are the flux hmits in |xly. 







Xu et al. 


Franceschini et al. 




GallCS 




IMPZ 


S'se 


(2> 


ro 


(^> 


ro 


(^) 


ro 


(^> 


ro 


4.0 


0.88 


2.85 ±0.33 


0.94 


2.95 ± 0.34 


1.00 


2.93 ±0.34 


0.68 


2.46 ±0.28 


6.3 


0.84 


3.18 ±0.38 


0.88 


3.22 ±0.38 


0.90 


3.18 ±0.38 


0.68 


2.79 ±0.33 


10.0 


0.78 


3.97 ±0.32 


0.82 


3.92 ±0.32 


0.80 


3.84 ±0.31 


0.68 


3.48 ±0.28 


15.9 


0.74 


4.82 ±0.15 


0.74 
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5.09 ±0.26 
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5.91 ±0.46 


0.62 


5.55 ±0.43 


0.50 


4.82 ± 0.37 


0.60 


5.28 ±0.41 


63.4 


0.58 


7.70 ±0.17 


0.56 


7.33 ±0.17 


0.40 


5.78 ±0.13 


0.50 


6.70 ±0.15 


100.5 


0.46 


8.52 ±0.19 


0.48 


8.27 ±0.18 


0.30 


5.87 ±0.13 


0.38 


6.47 ±0.14 


159.2 


0.36 


8.75 ±0.40 


0.36 


8.49 ±0.39 


0.25 


5.66 ± 0.26 


0.28 


5.58 ±0.25 


252.4 


0.26 


8.58 ±0.50 


0.24 


7.97 ±0.47 


0.20 


5.61 ±0.33 


0.20 


5.59 ±0.33 


400.0 


0.20 


8.51 ±0.42 
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6.15 ±0.31 
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Figure 3. The amplitude, A, of the angular coiTelation function, as a func- 
tion of i^-band limiting magnitude (Vega s\^tem), for our SWIRE data 
(solid circles) and surveys from the literature JBa ugh et al. 1996; Kong et al. 
2006|j Kummel & Wagneil2OO0HMcCrcaken et al.i2000i:iRoche et aljl99a 
19991 2002, 200^. The ii'-band limits of flie SWIRE data have been esti- 
mated from average K — mss colours (section|5). The lines are models of 
clustering evolution from .Roche et al. ( 2003,) . 



ming the contribution of all the stars they contain, tracking their 
age and metallicity. A mock catalogue is generated by projecting a 
cone through the simulation at a series of timesteps (redshifts), and 
calculating the properties of the galaxies 'observed' in the cone. 
The GallCS project have made availablqj these 1-deg cones, from 
which we have extracted mock catalogues of the SWIRE survey. 

We calculated the two-point angular correlation functions for 
the eight GallCS catalogues, each time using the eleven flux- 
limited samples corresponding to the flux limits listed in Table [T] 
We used the same method as for the SWIRE data (section |4j and 
calculated errors and covariances from the eight independent sam- 
ples. The correlation functions were fitted by a power-law model 
(Eg. lilt with fixed 7 = 1.8, to determine the amplitudes and the 
integral constraints. The results are given in Table|3] 

In Fig.|4]we compare the SWIRE correlation functions (data 
points) with the GallCS results, where the shaded regions are the 
one-sigma error bounds from the mock catalogues, and all datasets 
have been corrected for their integral constraints. At the brightest 



http://galics.cosmologie.fr/ 



Table 3. Angular and spatial clustering strengths for the GallCS simula- 
tions. S36 are the flux limits, A are the amplitudes of the angular coiTela- 
tion functions, AC are the integral constraints, (z) are the median redshifts 
and ro are the spatial correlation lengths. 



•S'ae 


A 


AC 


(^> 


ro 


laJy 


10-3 


10-3 




h-^ Mpc 


4.0 


0.77 ± 0.03 


1.56 ±0.06 


1.00 


3.76 ±0.08 


6.3 


0.87 ±0.05 


1.76 ±0.09 


0.90 


3.81 ±0.11 


10.0 


1.28 ±0.06 


2.57 ±0.12 


0.80 


4.44 ±0.11 


15.9 


1.50 ±0.13 


3.03 ±0.25 


0.70 


4.53 ±0.21 


25.2 


1.87 ±0.13 


3.77 ±0.25 


0.60 


4.67 ±0.17 


40.0 


2.58 ±0.14 


5.21 ±0.27 


0.50 


4.94 ±0.14 


63.4 


3.85 ±0.42 


7.76 ± 0.86 


0.40 


5.15 ±0.32 


100.5 


4.18 ±0.53 


8.43 ±1.07 


0.30 


4.27 ±0.30 


159.2 


10.67 ±2.07 


21.52 ±4.18 


0.25 


5.48 ±0.59 


252.4 


13.47 ±2.42 


27.16 ±4.88 


0.20 


4.78 ±0.48 


400.0 


15.92 ±4.06 


32.07 ±8.18 


0.15 


4.08 ±0.58 



flux limits {83.6 > 159 /.iJy), the large uncertainties in the model 
correlation functions, particularly on small scales, are due to the 
small size (1 sq. deg.) of the simulations compared with the data 
(8.1 sq. deg.). There are less than 1000 sources in each of these 
bright samples, and it is seen that this is insufficient to measure 
w{6) accurately at S < 0.01 deg. Similarly, the greater uncertain- 
ties in the models on large scales are due to the smaller angular size 
of the GallCS catalogues. The largest angular scale of the simula- 
tions is 1.4 deg compared with 4.1 deg for the data, although the 
measurement of wiO) becomes uncertain on scales much smaller 
than the maximum extent of either survey. 

We compared the GallCS correlation functions with the 
SWIRE results using a y^ test. If the two smallest scale bins are ex- 
cluded from the comparison, then the data and simulations do not 
differ significantly. Even for the faintest two samples (4 and 6 |i.Jy), 
where the simulations lie consistently above the data in Fig. |4] the 
difference is not statistically significant. However the small-scale 
discrepencies between the simulations and the data are worth not- 
ing. The GallCS correlation functions deviate from a power law at 
scales < 30 arcsec at all flux limits for which there is good data 
on small scales. The x^ test gives the probability that the data and 
simulations are drawn from the same distribution as only 10^^- 
lO"* for these small scales, for almost all samples brighter than 
536 ^ 15 nJy. 

This lack of close (< lOO/i"^ kpc) pairs in the GallCS 
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Figure 4. Angular correlation functions w{8) for the eleven flux-limited samples compared with the GallCS simulations. The shaded regions are the one-sigma 
eiTor bounds on w{9) from the mock catalogues; the data points correspond to Fig. [2] Both the data and the simulation results have been corrected for their 
integral constraints. 



simulations, corresponding to the scale of galax y groups and 
smaller, was also observed by iBlaizot et alj 1 120061) in their com- 
parison of the GallCS angular correlation funct ions with those of 
the Sloan Digital Sky Survey JYork et al.ll200d) . This can be ex- 
plain ed within the context of Ha lo Occupation Distribution models 
(e.g., lBerlind & Weinbera2002n . Such models have shown that the 
galaxy correlation function can be decomposed into two terms: (i) 
correlations between galaxies in different halos (large scales), and 
(ii) correlations bet ween pairs of galaxie s located within the same 
halo (small scales). Isiaizot et all ( |2006|) showed that the GallCS 



simulations underestimate this clustering of galaxies within a sin- 
gle halo, and this leads to the turnover of the GallCS correlation 
function at small scales, as we see in Fig.|4] 

There is a second factor which may contibute to the underesti- 
mate of small-scale clustering in the simulation results. The galaxy 
mass resolution limit of GallCS corresponds to a limiting galaxy lu- 
minosity (Mk < —22.7 in the restframe A'-band) and an absence 
of low-mass/low-luminosity galaxies in the simulations would also 
reduce the number of close pairs. A luminous galaxy is far more 
likely to have a low-luminosity companion, simply because their 
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number density is that much greater, and if these low-luminosity 
galaxies are missing from the simulation, then there will be an ab- 
sence of close pairs. As discussed below (Section [7](, the fainter 
samples contain proportionately more low-luminosity galaxies than 
the bright samples, and if these are missing from the GallCS mod- 
els due to the mass resolution limit, then that could also explain the 
trend for the models to over-predict the clustering at 4 and 6 nJy. 

Overall, the GallCS simulations are in excellent agreement 
with the angular clustering of 3.6-|J.m selected galaxies in SWIRE. 
However, the failure to resolve low-mass sources and to correctly 
predict the clustering within a given halo, indicate that these data 
are pushing the GallCS models to their limit. 



7 SPATIAL CLUSTERING 

7.1 Redshift distributions 

The angular correlation function is the projection along the line of 
sight of the spatial correlation function. We have used the inverse 
of Limber's equation (Eq.[5j to estimate the strength of the spatial 
clustering, expressed as the correlation length, ro, from our mea- 
surements of the angular clustering amplitude, A. Limber's equa- 
tion is expressed in terms of the redshift distribution of the sample, 
dN/dz, which incorporates the radial selection function, i.e. the 
probability that a source at a given redshift could have been de- 
tected in the survey. We have used four independent estimates of 
the redshift distribution to calculate the spatial correlation lengths 
for each SWIRE sample. 

The first two distribu tions were those pred icted from the phe- 
nomenological models of IXu et al.l ( 120030 and iFranceschini et al.l 
( 120061) . These models combine local luminosity functions with 
parametric modeling of luminosity and/or density evolution to pre- 
dict the relative numbers of different galaxy populations as a func- 
tion of redshift. Several populations are defined (for example, early- 
type, late-type, starburst, AGN), with spectral energy distributions 
drawn from observed or model template libraries, giving multi- 
wavelength predictions for the evolution of the galaxy populations. 
These models fit a wide range of observational data, in particular 
the mid-infrared number counts from Spitzer suiwey s. 

The GallCS simulations ( iHatton et aT] 120031) provided the 
third redshift distribution, and the fourth estimate of dN/dz was 
based on the SWIR E survey directly, using the IMPZ photometric 
redsh ift catalogue (Babbedge et al.''2004: iRowan-Robinson et al.l 
120051) . The redshift distributions are shown in Fig.|5]and their me- 
dian redshifts are listed in Table |2] The median redshifts are in 
good agreement with each other for flux limits of S^e = 25 (xJy 
and brighter, corresponding to median redshifts of (z) < 0.7- 
0.8. Fainter than this, the model distributions continue to shift to 
higher redshift with decreasing flux limit, but the median redshift 
of the observational estimate (iMPZ) remains constant. This pri- 
marily reflects the incompleteness of the optical identifications of 
the SWIRE survey, as the photometric redshift code is driven by 
the optical data. We find that only 40 per cent of the 4 fxly sample 
have photometric redshifts compared with > 90 per cent of sources 
brighter than 40 viJy. The sources without redshifts are those that 
are optically faint and so are more likely to be galaxies at high red- 
shift. 

Also plotted in Fig. \5\ (sh aded histograms) are the redshift 
distri butions of the K20 survey iCimatti et S]|2002l : lMignoli et all 
120050 . The K20 survey is a near-infrared (Kg < 20, Vega sys- 
tem) redshift survey of 545 sources, with a high spectroscopic 
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Figure 6. The comoving correlation length, rp, calculated from the SWIRE 
clustering amplitudes, as a fu nction o f median redshift. deri ved from eac h 
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dashed lines are the stable clustering (e = 0) pai'ametric models for dif- 
ferent normalizations at z = 0. The dotted line is the best-fitting epsilon 
model at z > 0.45, with e = 4.8 ± 0.8 and rg = 27 ± 6h~^ Mpc. 



completeness of 92 per cent. Using the equivalent A"-band flux 
limits of the SWIRE samples (section [Sj, we compared our esti- 
mates of the SWIRE redshift distributions with the observed K20 
data. The spectroscopic redshift distributions are consistent with 
the SWIRE estimates, however they do not distinguish between the 
different models or the iMPZ photometric redshifts - there are sim- 
ply too few sources with spectroscopic redshifts, particularly in the 
brighter samples. One can also see a peak in the K20 distributions 
at 2; ~ 0.7, corresponding to a cluster or other large-scale structure 
in the K20 survey. 

This illustrates how the small size of current A'-band selected 
redshift surveys (typically less than 1000 sources) restricts their 
usefulness in defining a redshift distribution, due to small-number 
statistics and cosmic variance. This, together with significant selec- 
tion biases (typically they target high-redshift or very red sources), 
makes them unsuitable as redshift distributions for calculating the 
inversion of Limber's equation, hence we use the photometric and 
model estimates in the following sections. 



7.2 Spatial correlation lengths 

Taking each of the redshift distributions in turn, we have used the 
inverse of Limber's equation (Eq. [Sj to calculate the correlation 
length, ro, from the angular clustering amplitude of each of the 
samples. Setting e = 7--3 = — 1.2 inEq.|2]forthecase of comov- 
ing clustering gives a comoving value of r o . The correlation lengths 
are given in Table|2]and are plotted as a function of median redshift 
in Fig. [6] for each dN/dz (the plot excludes the iMPZ results for 
the faintest four samples, which are incomplete). 

For the five brightest samples, corresponding to (z) < 0.5, 
the correlation length varies slowly with redshift for each of the 
dN/dz distributions, but with an apparent dichotomy between the 
phenomological models with ro — 8/i~^ Mpc and the iMPZ and 
GallCS estimates with ro — Qh~^ Mpc. The uncertainty in the 
correlation length is dominated by this scatter between the dN/dz 
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estimates, not by the statistical uncertainty in measuring the angular 
clustering. 

For all dN/dz distributions, the correlation length decreases 
rapidly with increasing redshift in the fainter samples (Fig. ^, 
falling from m ~ 6-8/i"^ Mpc at z ~ 0.5 to 3/i"^ Mpc at 2 ~ 1. 
Although the scatter in the median redshifts of the distributions are 
larger in these fainter samples, the average correlation lengths are 
more tightly constrained than than those at lower redshift. (This 
is easily understood: the scatter between the median values of each 
redshift distribution is small compared with the actual widths of the 



distributions, but at lower redshifts this is not the case.) If this high- 
redshift {z > 0.45) evolution is fitted by an epsilon model (Eq.O, 
we get best-fitting values of ro = 27±6/i^^ Mpc and e = 4.8±0.8 
(Fig. [6] dotted line). This is very strong evolution, and implies that 
these sources are in environments that would evolve into massive 
clusters by 2: = (c.f. Fig.[9]l. Although we do not consider the 
epsilon model to be a realistic model of evolution to the present 
day, it does give some quantitive indication of the rapid change in 
clustering seen at these redshifts. 

The spatial correlation lengths for the GallCS simulated cat- 
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Table 4. Average 3.6 \xm luminosities, login(J'3. 6/^w). and redshifts, 
(z), for the Xu et al. (2003) and France schini et al.1 QQ 
distributions. 536 are tlie flux limits (iJjy). 
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Xu et al. 

logioC-f'S.e/i©) 



Franceschini et al. 

(^> logioC-^s.e/i©) 



4.0 


0.88 


9.45 ± 0.48 


0.94 


9.24 ±0.55 


6.3 


0.84 


9.50 ±0.45 


0.88 


9.29 ±0.52 


10.0 


0.78 


9.57 ±0.42 


0.82 


9.39 ±0.50 


15.9 


0.74 


9.64 ±0.40 


0.74 


9.44 ± 0.48 


25.2 


0.70 


9.72 ±0.39 


0.68 


9.54 ± 0.45 


40.0 


0.66 


9.80 ±0.39 
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9.64 ± 0.42 
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0.18 


9.64 ±0.50 



alogues have been calculated in exactly the same way as those for 
the SWIRE observations. The amplitudes of the angular clustering 
(section|6j plus the redshift distributions (section lTTt of the GallCS 
samples were used to calculate ro through the inverse of Limber's 
equation (Eq.[5]l. These correlation lengths are also plotted in Fig.|6] 
(solid line, with the one-sigma errors in grey) for comparison with 
the results inferred from the observations. The GallCS simulations 
closely follow a stable clustering model, with a present-day corre- 
lation length of ro — 6h~^ Mpc. As expected from the good agree- 
ment between the angular correlation functions of the SWIRE data 
and the GallCS simulations, the spatial correlation lengths from 
GallCS (grey region in Fig.|6]l are consistent with those of SWIRE 
based on the GallCS redshift distributions (squares). 

In Fig. |6] we see that the correlation lengths derived from 
the GallCS dN/dz (both the SWIRE values and the simula- 
tions) differ mark e dly fr om t he results based on the models of 
iFranceschini et alj ilOOdt) and IXu et alj l l2003h . The latter predict 
that the SWIRE results are due to stronger clustering at low me- 
dian redshifts, with rapid evolution at 2; > 0.5, which is in con- 
trast to the slowly evolving stable clustering derived from the Gal- 
lCS simulations. The results based on the IMPZ photometric red- 
shifts generally follow the GallCS data but are noticably higher at 
z ~ 0.5, suggesting more complex evolution than either the mod- 
els or simulations predict. Given that the iMPZ redshift distribu- 
tions are based on empirical data rather than models, we consider 
these results to be the best estimate of the true correlation lengths, 
adopting the GallCS results for samples fainter than Ssa — 25 (xJy 
where the iMPZ data are incomplete. These correlation lengths are 
reproduced in Table[T]alongside the angular measurements. 

So far we have interpreted the change in ro as evolution in 
the clustering strength, but another possibility is that we are look- 
ing at different populations of sources in the different samples. For 
example, the fainter samples may be dominated by less massive 
galaxies which are located in halos of lower mass and so are in- 
trinsically less clustered (e.g., iLovedav et al.lll995l . iNorberg et al.l 
l2002l : IZehavi et alJuOOST) . We have investigated this possibility us- 
ing the 3.6-^im luminosity (Ls.e) as a proxy for stellar mass. In 
Fig. |7] we plot the luminosity distributions of each sample from 
the [Franceschini et al. ( 2006) and Xu et al. ( 2003) models, show- 
ing the contribution from early-type (elliptical/lenticular) and late- 
type (spiral, irregular, starburst) populations separately. For 3.6- (xm 
luminosities below 10^^ L0, the two models predict very simi- 
lar distributions for the total counts, but at higher luminosities the 
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Figure 8. Correlation lengths, ro, as a function of 3.6 nm luminosity, 
login(.L 3 .6/^fl ). derived from the [Franceschini et al.l J2006l stars) and 
IXu et alj <2003l circles) model redshift distributions. The symbol size is 
proportional to the median redshift of each sample. Luminosity errors are 
the standard deviations of the luminosity distributions. 



IXu et al.l ( l2003h models have an exten ded tail of luminous late- 
type galaxies that are not present in the IFranceschini et alj ( 120060 
models. At bright flux limits (Sae ^ 40 viJy), the late-types are 
about 30-40 per cent of the total number of galaxies. The fraction 
of late-types then increases with decreasing flux limit, until in the 
faintest sample the late-types are 50 per cent of the total accord- 
ing to_thej2Cu_et_alJj2003) model and 75 per cent of the total in 
the IFranceschini et alj C2006.) model. In all cases, the redshift dis- 
tributions of the early-type galaxies are weighted towards higher 
redshifts than those of the late-type galaxies. 

In Fig. [8] we plot the spatial correlation lengths against the 
average 3.6-^m luminosities for the two models, and tabulate the 
luminosities in Tab.|4| For the brightest samples, i.e. the lowest me- 
dian redshifts, the average luminosity is approximately constant, at 
log^Q(L3.6/L0) ~ 9.8. At fainter flux limits (5*36 ~ 40 jily and 
below), it is seen that there is a trend for ro to decrease with de- 
creasing average luminosity. Recalling that there is a progressively 
larger fraction of late-type galaxies as the flux limit decreases, both 
the decrease in average luminosity (mass) and the weaker cluster- 
ing could be a consequence of the changing galaxy population - i.e. 
an increase in the number of lower-mass late-type galaxies. How- 
ever, with the data we have available we cannot unambiguously 
disentangle evolution in the clustering strength from evolution in 
the relative populations of early-types and late-types. 

Finally, we have compared the best estimates of the correla- 
tion lengths (Table [Tjl with values of ro from the literature. This 
is shown in Fig. |9| where we plot ro as a function of redshift for 
our data, together with a variety of galaxy and AGN correlation 
lengths, and several models, compiled by Farrah et al. (2006). The 
SWIRE sources are the most clustered population of galaxies at 
z < 1, with only clusters of galaxies exceeding the correlation 
le ngths of the SWIRE sam ples. The solid lines are the halo models 
of Matarr ese et alj d 19970 which allow us to estimate the approxi- 
mate masses of the halos that host the 3.6-nm selected sources. At 
z < 0.5 the inferred halo masses are approximately constant with 
redshift with log(MHaio/M0) ~ 13.5. The correlation lengths 
have been shown above to decrease with increasing redshift be- 
yond z — 0.5 and this corresponds to a decrease in halo mass from 
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Figure 7. The distribution of 3.6 |am luminosities for each of the eleven flux-limited samples from the lpranceschini et alj I200d green) and lXu et alj 420031 red) 
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total number of sources. 



10"'^ Mq to 10^^ M0 at 2: = 1. This is consistent with the idea 
that we are detecting more late-type galaxies, which have lower 
masses than the early-types that are prevalent in the brighter sam- 
ples. 



8 CONCLUSIONS 

The SWIRE survey has allowed us to measure the angular cluster- 
ing of sources selected at 3.6 (xm over larger scales and with greater 



significance than previously has been possible. The two-point an- 
gular correlation function has been measured in eleven flux-limited 
samples down to Ssa = 4 fi.Jy, corresponding to median redshifts 
2; ^ 1. These angular results are in good agreement with the Gal- 
ICS simulations and with K-band clustering measurements. 

We have used Limber's equation to infer the spatial correla- 
tion lengths from the angular measurements and four estimates of 
the redshift distributions of each sample. We find that the system- 
atic uncertainty in the dN/dz distribution dominates the statistical 
errors, and adopt the photometric redshift distribution from IMPZ 
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Figure 9. The best estimate of the comoving correlation length, rg, of our 
samples (red circles) as a functi on of redshift, compared with values from 
the literature JFan'ah et al JlOOfl and references therein). The solid lines are 
the models of lMatan'ese et alj (19921) for halos of fixed mass, with masses 
as indicated. Also shown are indicative e models of clustering evolution: 
stable and linear models with rg = 7h~^ Mpc, corresponding to the aver- 
age of our low-z results; and a linear m odel normalized to th e dark matter 
clustering strength, rg = 5h~^ Mpc, of lJenkins et alj 09981) . 



as the best estimate of dN/dz, extrapolating to higher redshifts 
using the GallCS results. The comoving correlation length varies 
slowly around ro = 6.1 ± 0.5/i^^ Mpc out to a median redshift of 
z = 0.5 and then decreases with increasing redshift. If this change 
at higher redshifts is due to evolution in the clustering, then the re- 
quired evolution is very strong (with ro = 27 ± 6/i^^ Mpc and 
e = 4.8 ± 0.8) and these SWIRE galaxies would be in environ- 
ments that will evolve into massi ve clusters by the present day. 
There is some indication from the Ipranceschini et alj ( 120061) and 



IXu e t al. ( 2003) phenomological models that the change in corre- 
lation length is due to a decrease in the average 3.6-vim luminos- 
ity (or equivalently, stellar mass) of the fainter samples rather than 
evolution in the clustering. This decrease in average luminosity is 
the result of an increased fraction of late-type galaxies in the faint 
samples. Comparing t hese SWIRE data with the halo models of 
iMatarrese et al.l ( 119971) suggests that the fainter samples are select- 
ing sources in lower mass halos. These latter two results provide a 
consistent picture where lower-mass late-type galaxies are prefer- 
entially found in lower mass halos and thus have weaker clustering. 
The comparison of our data with the GallCS simulations has 
shown that these models are a good match to the angular clustering, 
but their redshift distributions differ markedly from the phenomo- 
logical models and the photometric redshift distributions. This 
highlights the need for a better understanding of the spatial clus- 
tering and its evolution. Higher-resolution numerical models and 
improved semi-analytic models will allow us to directly explore the 
relation between the clustering of the galaxies and the dark matter, 
i.e. the bias. Fo r example, the dark matter Millenium Simulation 
dSpringel et al.r 2005) has a lower mass resolution limit than Gal- 
lCS, and combined with semi-analytic models of the galaxies it is a 
better match to the area and depth of SWIRE than the current mod- 
els. Future work will explore the comparison between these models 
and the SWIRE data presented here. 
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